submitted by /u/Illustrious_Row_9971
[link] [comments]
( 111
min )
https://youtu.be/1gHUiNLYa20
This video explains and summarizes the 57 pages long "Building Machine Translation Systems for the Next Thousand Languages." paper from Google Research. It goes into the data collection, modelling processes and a bit into the results.
Paper link: https://arxiv.org/abs/2205.03983
Outline:
00:00 Machine translation for a 1000 languages
00:42 Weights&Biases (Sponsor)
02:00 Problems with many languages
04:15 Collecting data for 1k languages
11:46 Building MT models
14:13 Results on a thousand languages
submitted by /u/AICoffeeBreak
[link] [comments]
( 87
min )
submitted by /u/ajcvedia
[link] [comments]
( 90
min )
submitted by /u/gwern
[link] [comments]
( 86
min )
submitted by /u/ai-lover
[link] [comments]
( 86
min )
submitted by /u/gwern
[link] [comments]
( 86
min )
submitted by /u/pixelz_ai
[link] [comments]
( 86
min )
submitted by /u/kbf_
[link] [comments]
( 86
min )
submitted by /u/pwillia7
[link] [comments]
( 86
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 86
min )
submitted by /u/Due-Ad9795
[link] [comments]
( 86
min )
submitted by /u/prfitofthesngularity
[link] [comments]
( 86
min )
submitted by /u/tohelpyou88
[link] [comments]
( 86
min )
submitted by /u/Miffyli
[link] [comments]
( 86
min )
submitted by /u/gwern
[link] [comments]
( 117
min )
submitted by /u/JoshGrambo
[link] [comments]
( 86
min )
submitted by /u/mattsparkes
[link] [comments]
( 86
min )
submitted by /u/sopadebombillas
[link] [comments]
( 86
min )
Data mining : Linkedin Profile Scraper integrated with Language recognition to assign profile grades - YouTube
Based on the keyword provided the software will search for profiles and assign scores, It does around 50profiles per minut so It can check automatically 3000 profiles an hour and assign a score to each profile based on the keywords loaded.
submitted by /u/Tomislav23
[link] [comments]
( 89
min )
submitted by /u/1024cities
[link] [comments]
( 86
min )
submitted by /u/ai-lover
[link] [comments]
( 87
min )
submitted by /u/prfitofthesngularity
[link] [comments]
( 86
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 86
min )
Decision Tree Algorithm, Support Vector Method Algorithm, Logistic Regression, K-means Clustering Algorithm, and Naïve Bayesian…
( 14
min )
submitted by /u/BasicallyJustASpider
[link] [comments]
( 86
min )
AI Weirdness: the strange side of machine learning
( 2
min )
“Interpretability methods” seek to shed light on how machine-learning models make predictions, but researchers say to proceed with caution.
( 9
min )
PROOF: https://i.redd.it/2z42nlnbssc91.jpg
We’re part of the team behind Meta AI’s latest AI breakthrough in machine translation with our No Language Left Behind (NLLB) project. It’s a translation system that can support over 200 languages, even if there isn't a lot of text available to learn from. The reality is that a handful of languages dominate the web meaning only a fraction of the world can access content and contribute to the web in their own language. We want to change this by creating more inclusive machine translations systems – ones that unlock access to the web for the more than 4B people around the world that are currently excluded because they do not speak one of the few languages content is available in. Here are a few things about NLLB we’re excited for:
Latest breakth…
( 131
min )
Domain knowledge can sometimes boost model performance significantly. I have used knowledge graph in some of my projects as pre/post step to improve model performance. But that adds to deployment complexity. Theseus is good step towards end to end AI models that incorporate domain knowledge. It is library for differentiable nonlinear least squares (NLS) that is particularly useful for applications like robotics and computer visions.
Read more: https://ai.facebook.com/blog/theseus-a-library-for-encoding-domain-knowledge-in-end-to-end-ai-models/
submitted by /u/ashwan1
[link] [comments]
( 87
min )
submitted by /u/Repeat-or
[link] [comments]
( 86
min )
submitted by /u/ai-lover
[link] [comments]
( 87
min )
submitted by /u/widgia
[link] [comments]
( 86
min )
submitted by /u/kbf_
[link] [comments]
( 86
min )
Artificial Intelligence for Business Leaders Webinar
Join Professor Pedram Mokrian to learn how business leaders should think about developing AI solutions. Learn key AI terms, trends, and concepts that inform business strategy. Register for webinar.
submitted by /u/Stanford_Online
[link] [comments]
( 86
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 86
min )
submitted by /u/Eth_ai
[link] [comments]
( 86
min )
submitted by /u/widgia
[link] [comments]
( 90
min )
submitted by /u/Lozmosis
[link] [comments]
( 90
min )
submitted by /u/joanna58
[link] [comments]
( 86
min )
The process of building a machine learning (ML) model is iterative until you find the candidate model that is performing well and is ready to be deployed. As data scientists iterate through that process, they need a reliable method to easily track experiments to understand how each model version was built and how it performed. […]
( 10
min )
South Korean startup Lunit, developer of two FDA-cleared AI models for healthcare, went public this week on the country’s Kosdaq stock market. The move marks the maturity of the Seoul-based company — which was founded in 2013 and has for years been part of the NVIDIA Inception program that nurtures cutting-edge startups. Lunit’s AI software Read article >
The post Shifting Into High Gear: Lunit, Maker of FDA-Cleared AI for Cancer Analysis, Goes Public in Seoul appeared first on NVIDIA Blog.
( 6
min )
Epic Games is bringing a new Fortnite reward to GeForce NOW, available to all members. Drop from the Battle Bus in Fortnite on GeForce NOW between today and Thursday, Aug. 4, to earn “The Dish-stroyer Pickaxe” in game for free. Members can earn this item by streaming Fortnite on GeForce NOW Read article >
The post Get Battle Ready With New GeForce NOW Fortnite Reward appeared first on NVIDIA Blog.
( 5
min )
Thanks to earbuds you can have calls anywhere while doing anything. The problem: those on the other end of the call hear it all, too, from your roommate’s vacuum cleaner to background conversations at the cafe you’re working from. Now, work by a trio of graduate students at the University of Washington who spent the Read article >
The post Researchers Use GPUs to Give Earbud Users a ‘Mute Button’ for Background Noise appeared first on NVIDIA Blog.
( 5
min )
Business approach is changing constantly to all creative possibilities provided by the digital revolution. The big bang of the digital…
( 11
min )
submitted by /u/Weak_Individual_2010
[link] [comments]
( 86
min )
submitted by /u/tohelpyou88
[link] [comments]
( 86
min )
submitted by /u/nalr00n
[link] [comments]
( 86
min )
submitted by /u/zestysnacks
[link] [comments]
( 86
min )
submitted by /u/much_successes
[link] [comments]
( 86
min )
submitted by /u/techn0_cratic
[link] [comments]
( 91
min )
submitted by /u/LordPewPew777
[link] [comments]
( 86
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 86
min )
I was googling around as I'm starting to get interested in AI and these videos came up in the search, I clicked on it despite thinking it was going to be low value but am now intrigued as to what they are for!
Any Ideas?
https://www.youtube.com/watch?v=RfNtuHQ42v8
submitted by /u/timjwes
[link] [comments]
( 86
min )
submitted by /u/aremstudio
[link] [comments]
( 91
min )
submitted by /u/mergisi
[link] [comments]
( 86
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 86
min )
submitted by /u/the_anonymizer
[link] [comments]
( 86
min )
OpenAI blog post.
How DALL·E Credits Work.
Links to DALL-E Content policy and Terms of use, along with older archived versions.
submitted by /u/Wiskkey
[link] [comments]
( 90
min )
🍰 Slice of ML 🍰
Hi folks,
We recently built out a nice CLI that allows you to fetch the top ML tweets of the day/week! You can see a demo in the video above.
You can check out how we built it here & check out the repo here!
submitted by /u/BlockDesigns
[link] [comments]
( 87
min )
Geometric Deep Learning approaches a broad class of ML problems from the perspectives of symmetry and invariance, providing a common blueprint for neural network architectures as diverse as CNNs, GNNs, and Transformers.
In a new series of posts, we study how these ideas have taken us from ancient Greece to convolutional neural networks.
Blog post link.
submitted by /u/hardmaru
[link] [comments]
( 87
min )
submitted by /u/tohelpyou88
[link] [comments]
( 91
min )
As new data privacy regulations like GDPR (General Data Protection Regulation, 2017) have come into effect, customers are under increased pressure to monetize media assets while abiding by the new rules. Monetizing media while respecting privacy regulations requires the ability to automatically extract granular metadata from assets like text, images, video, and audio files at […]
( 10
min )
Large-scale models are revolutionizing deep learning and AI research, driving major improvements in language understanding, generating creative texts, multi-lingual translation and many more. But despite their remarkable capabilities, the models’ large size creates latency and cost constraints that hinder the deployment of applications on top of them. In particular, increased inference time and memory consumption […]
The post DeepSpeed Compression: A composable library for extreme compression and zero-cost quantization appeared first on Microsoft Research.
( 16
min )
Gaming has moved from a niche sector to the mainstream. Games have become a part of everyday lexicon like never before, and the technological progress evident within game UIs has played a role. The gaming landscape is highly diverse.
The post 4 Ways AI is Shaping the Future of Interactive Games appeared first on Data Science Central.
( 19
min )
Introduction Portable radio communication devices like walkie-talkie radios have supported security services for ages. Radio communication first gained traction during World War 1 when the military used Walkie-Talkie Radios exclusively to stay connected with their troops. Cut to today, we see security agents who are in charge of protecting people or property, using walkie-talkie radios… Read More »6 Reasons Why Today’s Physical Security Teams Can’t Rely on Walkie-Talkie Radios
The post 6 Reasons Why Today’s Physical Security Teams Can’t Rely on Walkie-Talkie Radios appeared first on Data Science Central.
( 19
min )
Announcements Achieving endpoint visibility to ward off the threat of a breach has never been more important than it is in the age of data proliferation and hybrid workplaces. Multiple endpoints and locations heighten that risk, making it essential for CISOs and IT security teams to overcome common challenges. Find out how organizations can reach… Read More »DSC Weekly 19 July 2022: From Knowledge Graphs to Transformation as a Service
The post DSC Weekly 19 July 2022: From Knowledge Graphs to Transformation as a Service appeared first on Data Science Central.
( 22
min )
AI and electric vehicle technology breakthroughs are transforming the automotive industry. These developments pave the way for new innovators, attracting technical prowess and design philosophies from Silicon Valley. Mike Bell, senior vice president of digital at Lucid Motors, sees continuous innovation coupled with over-the-air updates as key to designing sustainable, award-winning intelligent vehicles that provide Read article >
The post Lucid Motors’ Mike Bell on Software-Defined Innovation for the Luxury EV Brand appeared first on NVIDIA Blog.
( 4
min )
Methods that make a machine-learning model’s predictions more accurate overall can reduce accuracy for underrepresented subgroups. A new approach can help.
( 7
min )
When we work on a machine learning problem related to images, not only we need to collect some images as training data, but also need to employ augmentation to create variations in the image. It is especially true for more complex object recognition problems. There are many ways for image augmentation. You may use some […]
The post Image Augmentation with Keras Preprocessing Layers and tf.image appeared first on Machine Learning Mastery.
( 27
min )
submitted by /u/trcytony
[link] [comments]
( 86
min )
Hey guys! Let me share with you a cool piece that I came across in the latest IEEE newsletter issue. It’s a guide that covers a new approach to creating tinyML models. Hope you’ll find it useful: https://iot.ieee.org/newsletter/july-2022/automated-design-of-tiny-machine-learning-models-a-practical-guide-part-1
submitted by /u/Potsieramirez
[link] [comments]
( 92
min )
submitted by /u/jormungandrsjig
[link] [comments]
( 85
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 86
min )
submitted by /u/Mobeamers
[link] [comments]
( 92
min )
submitted by /u/RubiksCodeNMZ
[link] [comments]
( 91
min )
submitted by /u/OneFinding1429
[link] [comments]
( 86
min )
Are you struggling to get clicks on your Google Ads? You’re not alone. In fact, most people don’t know how to write headlines that get clicked. If you’re ready to learn how to write headlines that get more clicks, then this blog post is for you. You’ll learn some great headline writing tips that will… Read More »Google Ads Headlines: How To Write Headlines That Get More Clicks
The post Google Ads Headlines: How To Write Headlines That Get More Clicks appeared first on Data Science Central.
( 20
min )
Over the last few years, online education platforms have seen an increase in adoption of and an uptick in demand for video-based learnings because it offers an effective medium to engage learners. To expand to international markets and address a culturally and linguistically diverse population, businesses are also looking at diversifying their learning offerings by […]
( 10
min )
Renewable resources like sunlight provide a sustainable and carbon neutral mechanism to generate power. Governments in many countries are providing incentives and subsidies to households to install solar panels as part of small-scale renewable energy schemes. This has created a huge demand for solar panels. Reaching out to potential customers at the right time, through […]
( 10
min )
submitted by /u/regalalgorithm
[link] [comments]
( 86
min )
submitted by /u/mmiller9913
[link] [comments]
( 86
min )
Intel has recently released Neural Compressor, an open-source Python package for model compression. This library can be applied to deep learning deployment on CPUs or GPUs to decrease the model size and speed up inference. Additionally, it offers a uniform user interface for well-known network compression techniques, including quantization, pruning, and knowledge distillation across various deep learning frameworks. The tool’s automatic accuracy-driven tweaking technique can be utilized to generate the best-quantized model. Additionally, it allows knowledge distillation so that the knowledge from the teacher model may be transferred to the student model. It implements several weight pruning methods to produce pruned models using a predetermined sparsity goal. For improved framework interoperability, the Python library also offers APIs for various deep learning frameworks, including TensorFlow, PyTorch, and MXNet.
Continue reading | The Github repo for the library can be accessed here.
submitted by /u/ai-lover
[link] [comments]
( 87
min )
submitted by /u/bendee983
[link] [comments]
( 86
min )
submitted by /u/prfitofthesngularity
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/iFighting
[link] [comments]
( 90
min )
📚 Release notes:
👉 https://github.com/kornia/kornia/releases/tag/v0.6.6
📚 Docs and tutorials
👉 https://kornia.readthedocs.io/en/latest/
https://preview.redd.it/fu46z17xtac91.png?width=1060&format=png&auto=webp&s=e10c42173fca97e76d9e2ccdea8809f112c4392b
https://preview.redd.it/xy64c27xtac91.png?width=640&format=png&auto=webp&s=789a197ab894ac0f5716e276aff360b09fdfb8eb
submitted by /u/edgarriba
[link] [comments]
( 88
min )
The Golden State Warriors won the 2022 National Basketball Association (NBA) title for many reasons. Having one of the top 10 players in NBA history in Steph Curry certainly helps. But other teams have top 10 / top 15 players, and they didn’t make the finals (or even get into the playoffs, in one case).… Read More »What Do NBA Champions and CDOs have in Common? Success Requires Being 2-way Players
The post What Do NBA Champions and CDOs have in Common? Success Requires Being 2-way Players appeared first on Data Science Central.
( 20
min )
Amazon Polly, an AI generated text-to-speech service, enables you to automate and scale your interactive voice solutions, helping to improve productivity and reduce costs. As our customers continue to use Amazon Polly for its rich set of features and ease of use, we have observed a demand for the ability to simultaneously generate synchronized audio […]
( 7
min )
Amazon Rekognition allows you to mitigate fraudulent attacks and minimize onboarding friction for legitimate customers through a streamlined identity verification process. This can result in an increase in customer trust and safety. Key capabilities of this solution include: Register a new user using a selfie Register a new user after face match against an ID […]
( 10
min )
Today, we are implementing a new technique so that DALL·E generates images of people that more accurately reflect the diversity of the world’s population. This technique is applied at the system level when DALL·E is given a prompt describing a person that does not
( 3
min )
NVIDIA Fleet Command — a cloud service for deploying, managing and scaling AI applications at the edge — now includes features that enhance the seamless management of edge AI deployments around the world. With the scale of edge AI deployments, organizations can have up to thousands of independent edge locations that must be managed by Read article >
The post Living on the Edge: New Features for NVIDIA Fleet Command Deliver All-in-One Edge AI Management, Maintenance for Enterprises appeared first on NVIDIA Blog.
( 6
min )
Technology company CORSAIR and streaming partner BigCheeseKIT step In the NVIDIA Studio this week. A leader in high-performance gear and systems for gamers, content creators and PC enthusiasts, CORSAIR has integrated NVIDIA Broadcast technologies into its hardware and iCUE software. Similar AI enhancements have also been added to Elgato’s audio and video software, Wave Link and Camera Hub.
The post CORSAIR Integrates NVIDIA Broadcast’s Audio, Video AI Features in iCUE and Elgato Software This Week ‘In the NVIDIA Studio’ appeared first on NVIDIA Blog.
( 7
min )
For many organizations, trusting their data to the cloud requires having a complete understanding of and control over the environment in which that data resides and how it’s being processed. Microsoft understands this, and we are committed to building a trustworthy cloud—one in which security, privacy, and transparency are built into its core. A key […]
The post Confidential Containers: Verifiably secure computation in the cloud appeared first on Microsoft Research.
( 9
min )
As the name suggests, synthetic data is the data that is artificially generated rather than being created by actual events. In marketing…
( 9
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/tohelpyou88
[link] [comments]
( 86
min )
submitted by /u/LordPewPew777
[link] [comments]
( 86
min )
submitted by /u/ezikler
[link] [comments]
( 86
min )
submitted by /u/Eth_ai
[link] [comments]
( 86
min )
1) I asked OpenAI what kind of web application I should make to help make data analysts more efficient.
It responded by telling me to build an app using NLP to provide people with Excel formulas based on a given prompt.
2) I told OpenAI the idea in a separate API request and asked it for an available domain name.
It gave me www.excelformulabot.com, which I built.
submitted by /u/dabressler
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
I have a dataset of tagged and linked object bounding-boxes in sequential video frames. If that isn't clear, you can watch a demo here:
https://www.youtube.com/watch?v=QKxSzFaHsbc
For various reasons, it's possible that a trajectory could be 'broken' in the dataset. Quick visual scanning doesn't allow detection of a break in a single trajectory; there are so many horizontal links, it's tough to notice one of them being missing.
How would you economically eliminate a small percentage of breaks in trajectories?
Some things I've thought of:
* Bootstrapping, i.e. using a trained network to predict -> this is a bit complex, it's possible but not my first choice
* Build a tool to view all linked detections overlaid in a single frame (doesn't immediately identify broken trajectories, but it might help)
Is there any simple UI I can build to easily identify broken trajectories in the dataset?
submitted by /u/asfarley--
[link] [comments]
( 90
min )
The Python package {copent} v0.3 now available on PyPI, with the new function 'mvnt' that implements the method for estimating the copula entropy-based statistic for multivariate normality test. See arXiv:2206.05956 for more details.
GITHUB: https://github.com/majianthu/pycopent
PyPI: https://pypi.org/project/copent/
Your comments are welcome.
submitted by /u/majianthu
[link] [comments]
( 87
min )
submitted by /u/SoyGambas
[link] [comments]
( 89
min )
submitted by /u/VIPTankz123
[link] [comments]
( 88
min )
Here is the list of all >1,200 ICML 2022 (International Conference on Machine Learning) papers, and a highlight for each of them. ICML 2022 will take place from July 17 at Baltimore.
https://www.paperdigest.org/2022/07/icml-2022-highlights/
submitted by /u/biandangou
[link] [comments]
( 87
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 86
min )
submitted by /u/BasicallyJustASpider
[link] [comments]
( 86
min )
submitted by /u/kwasi3114
[link] [comments]
( 87
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/pixelz_ai
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/Zirius_Sadfaces
[link] [comments]
( 85
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 86
min )
submitted by /u/PixelzJ
[link] [comments]
( 86
min )
submitted by /u/AttarWrites
[link] [comments]
( 86
min )
submitted by /u/markurtz
[link] [comments]
( 86
min )
submitted by /u/ranjeettechnincal
[link] [comments]
( 86
min )
submitted by /u/sopadebombillas
[link] [comments]
( 86
min )
submitted by /u/VIPTankz123
[link] [comments]
( 86
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 86
min )
submitted by /u/straylittlelambs
[link] [comments]
( 89
min )
submitted by /u/ExtensionVirtual471
[link] [comments]
( 86
min )
submitted by /u/keghn
[link] [comments]
( 86
min )
UC Berkeley and Google AI Researchers Introduce ‘Director’: a Reinforcement Learning Agent that Learns Hierarchical Behaviors from Pixels by Planning in the Latent Space of a Learned World Model. The world model Director builds from pixels allows effective planning in a latent space. To anticipate future model states given future actions, the world model first maps pictures to model states. Director optimizes two policies based on the model states’ anticipated trajectories: Every predetermined number of steps, the management selects a new objective, and the employee learns to accomplish the goals using simple activities. The direction would have a difficult control challenge if they had to choose plans directly in the high-dimensional continuous representation space of the world model. To reduce the size of the discrete codes created by the model states, they instead learn a goal autoencoder. The goal autoencoder then transforms the discrete codes into model states and passes them as goals to the worker after the manager has chosen them.
✅ Director agent learns practical, general, and interpretable hierarchical behaviors from raw pixels
✅ Director successfully learns in a wide range of traditional RL environments, including Atari, Control Suite, DMLab, and Crafter
✅ Director outperforms exploration methods on tasks with sparse rewards, including 3D maze traversal with a quadruped robot from an egocentric camera and proprioception
Continue reading| Checkout the paper and project
submitted by /u/ai-lover
[link] [comments]
( 87
min )
submitted by /u/VIPTankz123
[link] [comments]
( 87
min )
submitted by /u/Mediocre-Bullfrog686
[link] [comments]
( 87
min )
submitted by /u/markurtz
[link] [comments]
( 89
min )
I would like to share personal insights about doing great research and towards being a globally leading researcher:
Not all our research legacies are correct or will be corrected shortly, so just keep taking the initiative to correct them. https://openreview.net/forum?id=xENf4QUL4LW¬eId=C2eCHs2k6CM.
Not all our papers get cited or published, so when our papers serve as a great foundation for other works, just keep positive and confident to deliver them to more people who may be interested.
Reddit discussion
Linkedin discussion
submitted by /u/XinshaoWang
[link] [comments]
( 88
min )
submitted by /u/rubikvn2100
[link] [comments]
( 87
min )
CUHK released aDeepFashion-MultiModal dataset with rich multi-modal annotations, including manually annotated human parsing labels, manually annotated human keypoints, manually annotated fine-grained labels and textual descriptions in June 2022. Since then, researchers have been looking to work with the dataset, fine-tune it with CLIP model and different metrics.
While finetuning I understand is an imp. process and a difficult one, they claim to have gained 217% Delta increase on Recall metric. When I have been trying to run it, my laptop has not been so capable to run this, so I am looking for alternative for remote GPU.
But, is this growth of 217% from pertained to fine-tuned model even possible? A bit hard to believe. If so, is Colab a good option to run remote GPU while being able to make use of the functionality?
submitted by /u/jeoyous
[link] [comments]
( 88
min )
submitted by /u/Noniax
[link] [comments]
( 86
min )
submitted by /u/Shreya001
[link] [comments]
( 86
min )
submitted by /u/_ayushp_
[link] [comments]
( 86
min )
A detailed and insightful study by MetaAI team on the memorization, overfitting and forgetting in LLMs.
The paper talks about how different definitions of "memorization" and how scaling affects the amount of training data that the large language models can memorize during the training phase. Studies are also presented on how the forgetting curves look like and how overfitting relates to memorization for these large language models. The Appendix section is a gold mine as well.
Annotated version of the paper - Github Link
submitted by /u/shreyansh26
[link] [comments]
( 86
min )
submitted by /u/ExtensionVirtual471
[link] [comments]
( 86
min )
submitted by /u/greentea387
[link] [comments]
( 86
min )
submitted by /u/jormungandrsjig
[link] [comments]
( 86
min )
submitted by /u/Shreya001
[link] [comments]
( 86
min )
A detailed and insightful study by MetaAI team on the memorization, overfitting and forgetting in LLMs.
The paper talks about how different definitions of "memorization" and how scaling affects the amount of training data that the large language models can memorize during the training phase. Studies are also presented on how the forgetting curves look like and how overfitting relates to memorization for these large language models. The Appendix section is a gold mine as well.
Annotated version of the paper - Github Link
submitted by /u/shreyansh26
[link] [comments]
( 87
min )
Hi folks,
I was working on a personal experimental project, which I thought of making it open source now. It saves much time for literature research.
If you are an industrial researcher or in academia, you probably spend much time reading research articles and news related to your topic.
If you try to search papers related to your topic, finding relevant documents on the internet takes time. You probably know the pain of extracting citations of articles from different websites.
Previously I used to fetch papers from google or semantic scholar, but semantic scholar does not show correct paper citations.
I am excited to announce RESP: Research Papers Search
Features:
Fetch all citations of a single paper from Google Scholar in CSV format
Fetch all related papers of a single paper from Google Scholar in CSV format
Fetch all connected papers from connectedpapers.com (it does not use a citation tree, it uses similarity to build graphs) in CSV format
Fetch relevant papers based on keywords from different sources, including Arxiv, ACL, ACM, PMLR, NeurIPS, cvf etc., in CSV format
GITHUB: https://github.com/monk1337/resp
Examples: https://github.com/monk1337/resp/tree/main/examples
I hope it will be helpful in your research. Thanks :)
submitted by /u/aadityaura
[link] [comments]
( 89
min )
Today, social media is a huge source of news. Users rely on platforms like Facebook and Twitter to consume news. For certain industries such as insurance companies, first respondents, law enforcement, and government agencies, being able to quickly process news about relevant events occurring can help them take action while these events are still unfolding. […]
( 9
min )
Australian animator Marko Matosevic is taking jokes from a children’s school dads’ group and breathing them into animated life with NVIDIA Omniverse, a virtual world simulation and collaboration platform for 3D workflows.
The post Meet the Omnivore: Animator Entertains and Explains With NVIDIA Omniverse appeared first on NVIDIA Blog.
( 5
min )
AI Weirdness: the strange side of machine learning
( 2
min )
submitted by /u/rubikvn2100
[link] [comments]
( 86
min )
Image dump 1
submitted by /u/OneFinding1429
[link] [comments]
( 85
min )
submitted by /u/PerryJ
[link] [comments]
( 86
min )
submitted by /u/Brilliant_Scratch_63
[link] [comments]
( 86
min )
submitted by /u/RedRainHoloAI
[link] [comments]
( 86
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 86
min )
submitted by /u/akolonin
[link] [comments]
( 86
min )
submitted by /u/PolymorphismPrince
[link] [comments]
( 87
min )
This is a guest blog post by Danny Brock, Rajeev Govindan and Krishnaram Kenthapadi at Fiddler AI. Your Amazon SageMaker models are live. They’re handling millions of inferences each day and driving better business outcomes for your company. They’re performing exactly as well as the day they were launched. Er, wait. Are they? Maybe. Maybe […]
( 7
min )
Data scientists often work towards understanding the effects of various data preprocessing and feature engineering strategies in combination with different model architectures and hyperparameters. Doing so requires you to cover large parameter spaces iteratively, and it can be overwhelming to keep track of previously run configurations and results while keeping experiments reproducible. This post walks […]
( 13
min )
Organizations are increasingly building and using machine learning (ML)-powered solutions for a variety of use cases and problems, including predictive maintenance of machine parts, product recommendations based on customer preferences, credit profiling, content moderation, fraud detection, and more. In many of these scenarios, the effectiveness and benefits derived from these ML-powered solutions can be further […]
( 13
min )
Sponsored Post If you’re a data engineer or data scientist, you know how hard it is to generate and maintain realistic data at scale. And to guarantee data privacy protection, in addition to all your day-to-day responsibilities? OOF. Talk about a heavy lift. But in today’s world, efficient data de-identification is no longer optional for […]
The post High-Fidelity Synthetic Data for Data Engineers and Data Scientists Alike appeared first on Machine Learning Mastery.
( 10
min )
Check out our new open source code editor for transforming data and building ML pipelines: https://github.com/mage-ai/mage-ai
If you’re available, I’d love to hop on a quick Zoom to help you get set up.
In the meantime, here is the install guide: https://github.com/mage-ai/mage-ai#using-pip and a short tutorial: https://github.com/mage-ai/mage-ai/blob/master/docs/tutorials/train_titanic_model/README.md
I’d love to get your feedback on whether this is useful to you or not. Thank you so much!
submitted by /u/ollie_wollie_rocks
[link] [comments]
( 87
min )
As part of our DALL·E 2 research preview, more than 3,000 artists from more than 118 countries have incorporated DALL·E into their creative workflows. The artists in our early access group have helped us discover new uses for DALL·E and have served as
( 6
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 86
min )
Precision agriculture has recently shown a lot of interest in computer vision technology. Computer vision, at the heart of robotics and…
( 10
min )
Investigate the ultimate truth this GFN Thursday with Loopmancer, now streaming to all members on GeForce NOW. Stuck in a death loop, RTX 3080 and Priority members can search for the truth with RTX ON — including NVIDIA DLSS and ray-traced reflections. Plus, players can enjoy the latest Genshin Impact event with the “Summer Fantasia” Read article >
The post Action on Repeat: GFN Thursday Brings Loopmancer With RTX ON to the Cloud appeared first on NVIDIA Blog.
( 5
min )
Announcements Achieving endpoint visibility to ward off the threat of a breach has never been more important than it is in the age of data proliferation and hybrid workplaces. Multiple endpoints and locations heighten that risk, making it essential for CISOs and IT security teams to overcome common challenges. Find out how organizations can reach… Read More »DSC Weekly 12 July 2022: The Emergence of the Modern Studio Model
The post DSC Weekly 12 July 2022: The Emergence of the Modern Studio Model appeared first on Data Science Central.
( 22
min )
Twitter thread:
https://twitter.com/karpathy/status/1547332300186066944
submitted by /u/EffectSizeQueen
[link] [comments]
( 92
min )
It looks like chronic kidney disease diagnosis has been solved in this paper: https://ieeexplore.ieee.org/document/8693581
I mean no disrespect to the authors, but this publication makes me slightly doubt the peer-review system. Or I am just such an amateur, that I am not seeing the brilliance behind this paper, which is also possible.
Have a read through it yourselves
submitted by /u/fanconic
[link] [comments]
( 97
min )
SimSwap (https://github.com/neuralchen/SimSwap) is basically a framework that carries out face-swapping in a similar way deepfake technology does with a source and a target video. However, for the source, only one image is required. Not sure how this would work since 1 image isn't enough for actual training. Is this simply face mapping? I feel like the output is a bit too sophisticated for that.
submitted by /u/thr0away89
[link] [comments]
( 86
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 86
min )
submitted by /u/Repeat-or
[link] [comments]
( 86
min )
submitted by /u/LordPewPew777
[link] [comments]
( 86
min )
submitted by /u/much_successes
[link] [comments]
( 85
min )
submitted by /u/Mrhelloistaken
[link] [comments]
( 85
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 86
min )
submitted by /u/chelsea_bear
[link] [comments]
( 86
min )
submitted by /u/Maruf2014
[link] [comments]
( 86
min )
submitted by /u/Lakshmireddys
[link] [comments]
( 84
min )
In this seminar Aditya introduces a framework that abstracts Reinforcement Learning (RL) as a sequence modeling problem. Watch on YouTube.
submitted by /u/Stanford_Online
[link] [comments]
( 86
min )
Artificial intelligence (AI) has become synonymous with assistance and efficiency. From a technology that was looked at with mistrust as…
( 10
min )
submitted by /u/vwxyzjn
[link] [comments]
( 84
min )
https://preview.redd.it/xjtcha3r35b91.png?width=1298&format=png&auto=webp&s=00873223c1ea0c6afcd5e22c7645521036b7e341
This post presents a way to run transformers models via the Python C API. The referenced notebook loads two txtai workflows, one that translates English to French and another that summarizes a webpage. After loading the models through C code, another example runs the workflows through assembly to show this works with any native code.
Full code links: Notebook | GitHub
submitted by /u/davidmezzetti
[link] [comments]
( 86
min )
submitted by /u/pixelz_ai
[link] [comments]
( 84
min )
submitted by /u/NarcoticSlug
[link] [comments]
( 84
min )
submitted by /u/Sollimann
[link] [comments]
( 84
min )
submitted by /u/kbf_
[link] [comments]
( 84
min )
submitted by /u/joemurray1994
[link] [comments]
( 84
min )
BigScience Project introduces BLOOM (BigScience Large Open-science Open-access Multilingual Language Model), the first multilingual Large Language Model (LLM) trained in complete transparency by the largest group of AI academics. Unlike the traditional secrecy of industrial AI research laboratories, the project demonstrates the possibility of training promising AI models published by the larger research community responsibly and openly.
✅ Transformers-based LLM
✅ 176B parameters (larger than GPT-3 and OPT-175B)
✅ Trained on 1.6TB text data, the equivalent of 320 times the complete works of Shakespeare
Continue reading | Download
submitted by /u/ai-lover
[link] [comments]
( 84
min )
submitted by /u/nalr00n
[link] [comments]
( 84
min )
submitted by /u/Racer_x32
[link] [comments]
( 86
min )
submitted by /u/Gloomy_Recognition_4
[link] [comments]
( 86
min )
submitted by /u/KazRainer
[link] [comments]
( 84
min )
submitted by /u/MrDemonFrog
[link] [comments]
( 84
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 84
min )
Touring vehicles just became a little more grand. Electric vehicle maker Human Horizons provided a detailed glimpse earlier this month of its latest production model, the GT HiPhi Z. The intelligent EV is poised to redefine the grand tourer category with innovative, software-defined capabilities that bring luxurious cruising to the next level. The vehicle’s marquee Read article >
The post Grand Entrance: Human Horizons Unveils Smart GT Built on NVIDIA DRIVE Orin appeared first on NVIDIA Blog.
( 5
min )
Kristel Michielsen was into quantum computing before quantum computing was cool. The computational physicist simulated quantum computers as part of her Ph.D. work in the Netherlands in the early 1990s. Today, she manages one of Europe’s largest facilities for quantum computing, the Jülich Unified Infrastructure for Quantum Computing (JUNIQ) . Her mission is to help Read article >
The post Merge Ahead: Researcher Takes Software Bridge to Quantum Computing appeared first on NVIDIA Blog.
( 6
min )
Visual effects savant Surfaced Studio steps In the NVIDIA Studio this week to share his clever film sequences, Fluid Simulation and Destruction, as well as his creative workflows. These sequences feature quirky visual effects that Surfaced Studio is renowned for demonstrating on his YouTube channel.
The post Sequences That Stun: Visual Effects Artist Surfaced Studio Arrives ‘In the NVIDIA Studio’ appeared first on NVIDIA Blog.
( 6
min )
A geometric deep-learning model is faster and more accurate than state-of-the-art computational models, reducing the chances and costs of drug trial failures.
( 6
min )
Web analytics tools offer vital insights into your website’s visitors’ behavior by tracking their real-time activities on the platform from behind. These tools study almost everything – the number of daily and regular visitors, sessions and duration, conversions, and beyond. You can access a comprehensive report covering every aspect and personalize it to focus on… Read More »Web Analytics Dashboards Carry a World of Data for Various Purposes
The post Web Analytics Dashboards Carry a World of Data for Various Purposes appeared first on Data Science Central.
( 18
min )
Convolutional neural networks have been found successful in computer vision applications. Various network architectures are proposed and they are neither magical nor hard to understand. In this tutorial, we will make sense of the operation of convolutional layers and their role in a larger convolutional neural network. After finishing this tutorial, you will learn: How […]
The post Understanding the Design of a Convolutional Neural Network appeared first on Machine Learning Mastery.
( 14
min )
Scrapy is highly customizable and developer friendly crawling framework in Python. It can help you build in few line wonderful crawler to…
( 11
min )
submitted by /u/Vasilkosturski
[link] [comments]
( 84
min )
A re-implementation of the famous 2020 paper - "Extracting Training Data from Large Language Models" by Nicholas Carlini, Florian Tramer et al.
Code - https://github.com/shreyansh26/Extracting-Training-Data-from-Large-Langauge-Models
The official implementation is great and I definitely learned a few things from it. In the re-implementation, I have also included the temperature-decay sampling and sliding-window-based minimum perplexity metric which was not present in the official implementation.
I checked the extracted Samples (refer to the Github repo) and they surely contained some memorized information.
submitted by /u/shreyansh26
[link] [comments]
( 85
min )
An awesome collection of Federated learning & Blockchain research papers in the Healthcare domain.
Federated learning, a mechanism of training a shared global model with a central server while keeping all the sensitive data in local institutions where the data belong, provides great promise to connect the fragmented healthcare data sources with privacy preservation. This repo contains a curated list of Federated Learning papers/resources and recent advancements in Healthcare.
As of now ~330 papers
Pr's welcome
https://github.com/monk1337/Aweome-Heathcare-Federated-Learning
submitted by /u/aadityaura
[link] [comments]
( 85
min )
submitted by /u/7NoteDancing
[link] [comments]
( 85
min )
submitted by /u/GetFlappy
[link] [comments]
( 84
min )
submitted by /u/mdfnb
[link] [comments]
( 84
min )
submitted by /u/deephugs
[link] [comments]
( 83
min )
submitted by /u/trcytony
[link] [comments]
( 84
min )
submitted by /u/biggbrother23
[link] [comments]
( 83
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 84
min )
submitted by /u/bendee983
[link] [comments]
( 84
min )
submitted by /u/oliviagolds
[link] [comments]
( 85
min )
submitted by /u/GroundbreakingLaw878
[link] [comments]
( 83
min )
submitted by /u/jormungandrsjig
[link] [comments]
( 85
min )
submitted by /u/GroundbreakingLaw878
[link] [comments]
( 84
min )
The unveiling by U.S. President Joe Biden Monday of the first full-color image from the James Webb Space Telescope is already astounding — and delighting — humans around the globe. “We can see possibilities nobody has ever seen before, we can go places nobody has ever gone before,” Biden said during a White House press Read article >
The post AI on the Sky: Stunning New Images From the James Webb Space Telescope To Be Analyzed by, Train, AI appeared first on NVIDIA Blog.
( 5
min )
Engineers are using the NVIDIA Omniverse 3D simulation platform as part of a proof of concept that promises to become a model for putting green energy to work around the world. Dubbed Gigastack, the pilot project — led by a consortium that includes Phillips 66 and Denmark-based renewable energy company Ørsted — will create low-emission Read article >
The post Windfall: Omniverse Accelerates Turning Wind Power Into Clean Hydrogen Fuel appeared first on NVIDIA Blog.
( 6
min )
LiDAR is a key enabling technology in growing autonomous markets, such as robotics, industrial, infrastructure, and automotive. LiDAR delivers precise 3D data about its environment in real time to provide “vision” for autonomous solutions. For autonomous vehicles (AVs), nearly every carmaker uses LiDAR to augment camera and radar systems for a comprehensive perception stack capable […]
( 13
min )
We live in a data-rich world. Very data rich. Indeed, it’s estimated that roughly 2.5 quintillion bytes of data are created every day. Perhaps because of its ubiquity, there are those who believe the sheer volume of available data means we have all we need to easily and accurately answer any question without delay. If… Read More »Why We Need to Move From Data-First to a Knowledge-First World
The post Why We Need to Move From Data-First to a Knowledge-First World appeared first on Data Science Central.
( 19
min )
The tech industry is abuzz with hyped up pontifications and bold predictions of the business-changing potential of Data Products. I could not be happier as it’s a topic I have explored in several blogs (see the end of this blog for a list of my blogs on Data Products…yea, I know, get a life). A… Read More »Critical Role of Analytic Profiles in Developing Data Products
The post Critical Role of Analytic Profiles in Developing Data Products appeared first on Data Science Central.
( 20
min )
According to the McKinsey Report called Value Creation in the Metaverse: $120b+ in investment has flowed into the metaverse so far in 2022 79% of consumers active on the metaverse have made a purchase >15% of corporate revenue is expected to come from the metaverse in the next 5 years according to 25% of senior… Read More »Metaverse use cases – Which industries could the metaverse impact?
The post Metaverse use cases – Which industries could the metaverse impact? appeared first on Data Science Central.
( 19
min )
Executing Industrial Internet of Things (IIoT) solutions is vital as the most competitive global manufacturing companies are becoming digital enterprises. Industrial Internet of Things (IIoT) solutions and platforms are leading the reshaping and transformation of landscapes. A pre-built Industrial Internet of Things (IIoT) solution offers the benefit of a ready-made “IoT development kit” with the… Read More »Features of IIoT (Industrial Internet of Things) Seamless Connectivity and Data Acquisition
The post Features of IIoT (Industrial Internet of Things) Seamless Connectivity and Data Acquisition appeared first on Data Science Central.
( 18
min )
On Wednesday, July 13th at 11 am EST, please join DQLabs for an exclusive virtual event“Defining Data Relevance: The rise of the Modern Data Stack and the Modern Data Quality Platform”. The data producers, consumers, and leaders deserve an ecosystem that delivers the data that is relevant to them – one size fits all approaches… Read More »Webinar Series -The rise of the Modern DataStack and the Modern Data Quality Platform
The post Webinar Series -The rise of the Modern DataStack and the Modern Data Quality Platform appeared first on Data Science Central.
( 17
min )
IT leaders are running into several RPA failures. Here, we have covered the top 7 reasons why RPA implementations fail and how you can…
( 12
min )
Crawling a website is as today an essential skill for anyone working in or with the digital industry. Firstly, I will start by clarifying…
( 10
min )
Google Imagen: a machine learning system that can generate graphics from text input.
( 6
min )
submitted by /u/tohelpyou88
[link] [comments]
( 84
min )
https://reddit.com/link/vw3tkf/video/xe0t4pumpta91/player
https://reddit.com/link/vw3tkf/video/7pf9dl3npta91/player
Programming
Function from Description
Code to Explanation
Fix invalid Code
Translate Languages
Class from Description
Get Language from Code
Function from Docstring
Helpers
Regex from Description
Regex to Explanation
Linux Command
Get time complexity
Git Command from Description
Database
Text Description to SQL Command
Web
Generate HTML from Description
CSS from Description
Meta Tags from Description
I think this could be helpful to a lot of people (especially for beginner programmers). You can check out all functionalities on your own here:
programming-helper.com
Have fun using the tool ❤️
submitted by /u/Capital_Revolution35
[link] [comments]
( 86
min )
submitted by /u/SpatialComputing
[link] [comments]
( 86
min )
Hello, the title says it all. I'm trying to find any ressources (mainly aligned corpus) that could be helpful in identifying and simplifying complex sentences in French. ALECTOR is the only one I stumbled upon.
Do you have any resources or tips? I was wondering if searching for book and their simplified version could be useful but I fear it would be more like learning to translate old french into modern french.
submitted by /u/Sacrezar
[link] [comments]
( 85
min )
submitted by /u/Albertrech
[link] [comments]
( 86
min )
submitted by /u/LordPewPew777
[link] [comments]
( 84
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 84
min )
submitted by /u/Gereshes
[link] [comments]
( 84
min )
In cooperative multi-agent reinforcement learning (MARL), due to its on-policy nature, policy gradient (PG) methods are typically believed to be less sample efficient than value decomposition (VD) methods, which are off-policy. However, some recent empirical studies demonstrate that with proper input representation and hyper-parameter tuning, multi-agent PG can achieve surprisingly strong performance compared to off-policy VD methods.
Why could PG methods work so well? In this post, we will present concrete analysis to show that in certain scenarios, e.g., environments with a highly multi-modal reward landscape, VD can be problematic and lead to undesired outcomes. By contrast, PG methods with individual policies can converge to an optimal policy in these cases. In addition, PG methods wit…
( 5
min )
submitted by /u/PixelzJ
[link] [comments]
( 84
min )
submitted by /u/jormungandrsjig
[link] [comments]
( 84
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 84
min )
submitted by /u/rikusorasephiroth
[link] [comments]
( 83
min )
submitted by /u/laul_pogan
[link] [comments]
( 83
min )
For several years, the Stratego board game has been regarded as one of the most promising areas of research in Artificial Intelligence. Stratego is a two-player board game in which each player attempts to take the other player’s flag. There are two main challenges in the game. 1) There are 10535 potential states in the Stratego game tree. 2) Each player in this game must consider 1066 possible deployments at the beginning of the game. Due to the various complex components of the game’s structure, the AI research community has made minimal progress in this area.
This research introduces DeepNash, an autonomous agent that can develop human-level expertise in the imperfect information game Stratego from scratch. Regularized Nash Dynamics (R-NaD), a principled, model-free reinforcement learning technique, is the prime backbone of DeepNash. DeepNash achieves an ε-Nash equilibrium by integrating R-NaD with deep neural network architecture. A Nash equilibrium ensures that the agent will perform well even when faced with the worst-case scenario opponent. The stratego game and a description of the DeepNash technique are shown in Figure 1.
Continue reading | Checkout the paper
submitted by /u/ai-lover
[link] [comments]
( 85
min )
submitted by /u/norcalnatv
[link] [comments]
( 86
min )
submitted by /u/Phat_N_Sassy33
[link] [comments]
( 84
min )
submitted by /u/henlo_there_fren
[link] [comments]
( 84
min )
submitted by /u/estasfuera
[link] [comments]
( 83
min )
In this research article, the researchers from UC Berkeley demonstrated that extracting from a sizable news corpus may effectively train language models on prior predicting problems.
Forecasting is a process that makes educated projections using previous data as inputs when identifying the direction of future trends. Forecasting future events in the real world, including pandemics, the economy, or the environment, is still complex but essential. Because dynamic information processing is a crucial component of efficient forecasting, AI researchers are considering using strong large-scale language models to automate these processes.
Researchers present a dataset with tens of thousands of forecasting questions and a date-based news corpus in the new paper Forecasting Future World Events with Neural Networks. They also curate IntervalQA, a dataset with numerical questions and metrics for calibration.
Continue reading | Checkout the paper and github
submitted by /u/ai-lover
[link] [comments]
( 85
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 84
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 84
min )
Training a neural network or large deep learning model is a difficult optimization task. The classical algorithm to train neural networks is called stochastic gradient descent. It has been well established that you can achieve increased performance and faster training on some problems by using a learning rate that changes during training. In this post […]
The post Using Learning Rate Schedules for Deep Learning Models in Python with Keras appeared first on Machine Learning Mastery.
( 24
min )
submitted by /u/gwern
[link] [comments]
( 85
min )
A one-of-a-kind electric race car revved to life before it was manufactured — or even prototyped — thanks to GPU-powered extended reality technology. At the Automotive Innovation Forum in May, NVIDIA worked with Autodesk VRED to showcase a photorealistic Porsche electric sports car in augmented reality, with multiple attendees collaborating in the same immersive environment. Read article >
The post No Fueling Around: Designers Collaborate in Extended Reality on Porsche Electric Race Car appeared first on NVIDIA Blog.
( 5
min )
What is document management?
( 8
min )
The role of AI in the Metaverse has yet to be established. Is AI and blockchain technology a good fit?
( 9
min )
Optical character recognition (OCR) is the task of converting printed or handwritten text into machine-encoded text. OCR has been widely used in various scenarios, such as document electronization and identity authentication. Because OCR can greatly reduce the manual effort to register key information and serve as an entry step for understanding large volumes of documents, […]
( 11
min )
https://youtu.be/jSdHmImyUjk
Yann LeCun's position paper on a path towards machine intelligence combines Self-Supervised Learning, Energy-Based Models, and hierarchical predictive embedding models to arrive at a system that can teach itself to learn useful abstractions at multiple levels and use that as a world model to plan ahead in time.
OUTLINE:
0:00 - Introduction
2:00 - Main Contributions
5:45 - Mode 1 and Mode 2 actors
15:40 - Self-Supervised Learning and Energy-Based Models
20:15 - Introducing latent variables
25:00 - The problem of collapse
29:50 - Contrastive vs regularized methods
36:00 - The JEPA architecture
47:00 - Hierarchical JEPA (H-JEPA)
53:00 - Broader relevance
56:00 - Summary & Comments
Paper: https://openreview.net/forum?id=BZ5a1r-kVsf
submitted by /u/ykilcher
[link] [comments]
( 86
min )
I recently noticed a Weibo (Chinese Twitter) thread of an alarming potential academic misconduct - Prof. Yisen Wang’s girlfriend accused him of cheating and collusion behaviors in recent top-tier machine learning conferences, including but may not limit to NeurIPS2021 and ICML2021. Yisen Wang (homepage: https://yisenwang.github.io/) obtained his Ph.D. degree at Tsinghua University (China) and is now an assistant professor at Peking University (China). Yisen is interested in adversarial attack, etc.
Here are some facts from Yisen’s girlfriend’s post:
[Cheating in best paper nomination in ICML 2021] In ICML2021, Yisen asked one area chair of ICML2021 to recommend his first PhD student Jingyi Cui’s paper to be best paper candidate(I am not sure if it is termed as “best paper candidate”, …
( 93
min )
submitted by /u/ezikler
[link] [comments]
( 84
min )
Almost all industries are now using machine learning systems to improve the efficiency and dependability of their work. With the increasing use of ML, companies have seen a boom in the investments in the resources needed to support ML systems. Additionally, a single ML process necessitates the execution of numerous distinct models, further complicating the process and increasing costs.
The idea of “Unified Models” was established in recent years, where a single model is constructed to power a process or product rather than a collection of connected but independent models. Combining all of the necessary data into one array and passing it to the model makes it possible to create a unified model that delivers all of the findings at once rather than by calling individual models one at a time.
Continue reading | Check out the demo
submitted by /u/ai-lover
[link] [comments]
( 85
min )
submitted by /u/LordPewPew777
[link] [comments]
( 84
min )
submitted by /u/tohelpyou88
[link] [comments]
( 84
min )
submitted by /u/estasfuera
[link] [comments]
( 85
min )
In short: Data Engineers are still the most sought-after professionals in the field (more engineering, less "modeling"?), demand for analysts and leadership (!) roles is on the rise.
Full insights here: https://insights.ai-jobs.net/the-10-most-in-demand-jobs-in-ai-ml-and-big-data-in-2022/
submitted by /u/ai_jobs
[link] [comments]
( 84
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 84
min )
Let’s say you have identified a use case in your organization that you would like to handle via a chatbot. You familiarized yourself with Amazon Lex, built a prototype, and did a few trial interactions with the bot. You liked the overall experience and now want to deploy the bot in your production environment, but […]
( 7
min )
Machine learning (ML) is disrupting a lot of industries at an unprecedented pace. The healthcare and life sciences (HCLS) industry has been going through a rapid evolution in recent years embracing ML across a multitude of use cases for delivering quality care and improving patient outcomes. In a typical ML lifecycle, data engineers and scientists […]
( 17
min )
NVIDIA’s latest corporate responsibility report shares our efforts in empowering employees and putting to work our technologies for the benefit of humanity. Amid ongoing global economic concerns and pandemic challenges, this year’s report highlights our ability to attract and retain talent that come here to do their life’s work while tackling some of the world’s Read article >
The post Mission-Driven: Takeaways From Our Corporate Responsibility Report appeared first on NVIDIA Blog.
( 7
min )
Nothing beats the summer heat like GFN Thursday. Get ready for four new titles streaming at GeForce quality across nearly any device. Buckle up for some great gaming, whether poolside, in the car for a long road trip, or in the air-conditioned comfort of home. Speaking of summer, it’s also last call for this year’s Read article >
The post GFN Thursday Brings New Games to GeForce NOW for the Perfect Summer Playlist appeared first on NVIDIA Blog.
( 5
min )
Want to learn about AI and machine learning? There are plenty of resources out there to help — blogs, podcasts, YouTube tutorials — perhaps too many. Machine learning engineer Santiago Valdarrama has taken a far more focused approach to helping us all get smarter about the field. He’s created a following by posing one machine Read article >
The post Wordle for AI: Santiago Valderrama on Getting Smarter on Machine Learning appeared first on NVIDIA Blog.
( 5
min )
submitted by /u/tohelpyou88
[link] [comments]
( 84
min )
One the biggest stories of the year in the AI community is about a Google engineer’s claim of sentient AI. This was part of Google’s LaMDA…
( 21
min )
Artificial intelligences as our allies
( 8
min )
Over the coming decade, deep learning looks set to have a transformational impact on the natural sciences. The consequences are potentially far-reaching and could dramatically improve our ability to model and predict natural phenomena over widely varying scales of space and time. Could this capability represent the dawn of a new paradigm of scientific discovery? […]
The post AI4Science to empower the fifth paradigm of scientific discovery appeared first on Microsoft Research.
( 10
min )
Researchers develop a comfortable, form-fitting fabric that recognizes its wearer’s activities, like walking, running, and jumping.
( 8
min )
submitted by /u/Lakshmireddys
[link] [comments]
( 83
min )
Tried to explain as much as possible in the title. I did a "run" of DALL-E and I have already used photoshop's macros to crop each of them in a different file bc I feel like there's an interesting experience in watching it go through similar but different iteractions, but I would like it to be sorted by similarity to make the most impact. Can any of you recommend me a way to do that? The first result I found in google pinged the antivirus so I felt like getting recommendations was the way to go.
Here's an example of that kind of images I'm talking about https://imgur.com/a/miG2WWZ
submitted by /u/quiteawhile
[link] [comments]
( 84
min )
submitted by /u/Farnectarine4825
[link] [comments]
( 84
min )
submitted by /u/LordPewPew777
[link] [comments]
( 84
min )
submitted by /u/much_successes
[link] [comments]
( 85
min )
submitted by /u/nalr00n
[link] [comments]
( 84
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 84
min )
submitted by /u/Lakshmireddys
[link] [comments]
( 84
min )
View the tutorial here: HERE
This tutorial teaches you how to transfer the style of one image to another image using neural-style-pt.
Below is a imgur gallery showing off the transformation process.
https://imgur.com/gallery/iMlkkQi
Let me know if you have any questions or comments.
submitted by /u/mshriver2
[link] [comments]
( 84
min )
Imagine a surgeon taking video calls with patients across the globe without the need of a human translator. What if a fledgling startup could easily expand their product across borders and into new geographical markets by offering fluid, accurate, multilingual customer support and sales, all without the need of a live human translator? What happens […]
( 10
min )
As the title suggests, I would like to know the correct way to pre-process the cityscapes dataset for object detection. There are multiple ways how this can be done. There is a version in Detectron2, in MM Detection, there is this. Which one is the correct way, without getting errors in the labels? Anybody worked with this before? Would be glad if anybody might have an idea.
submitted by /u/SeucheAchat9115
[link] [comments]
( 86
min )
Machine learning solutions are already embedded in the finance and banking industry. In this article, we reviewed the most popular use cases of ML in banking and shared practical tips on how to implement it into your business.https://exadel.com/news/how-machine-learning-is-used-in-finance-and-banking
submitted by /u/lklimusheuskaja
[link] [comments]
( 85
min )
Per his tweet at https://twitter.com/goodfellow_ian/status/1544638709039091717, Goodfellow will be a research scientist under Oriol Vinyals' Deep Learning team.
submitted by /u/The_Removed
[link] [comments]
( 90
min )
Hi all,
I've been an active practitioner in Deep Learning and then wanted to build something in MLOps.
So wanted to dig deeper in how DevOps evolved and wanted to check if MLOps can take the same path.
The findings are really great. Absolutely every tool doing well in the market is a clear replacement for DevOps tool in MLOps.
Here is my blog on it. Looking for feedback. If you have any comments, let me know. Will add them.
https://sachinchandra.substack.com/p/bringing-software-development-principles
submitted by /u/scb_11
[link] [comments]
( 89
min )
submitted by /u/Euphetar
[link] [comments]
( 84
min )
AI has brought a new life to art.
( 7
min )
Piction Health, founded by Susan Conover SM ’15, uses machine learning to help physicians identify and manage skin disease.
( 8
min )
An interesting article in the Systematic Biology journal about identifying insects: https://academic.oup.com/sysbio/article/68/6/876/5368535
See as well: Deep learning and computer vision will transform entomology
submitted by /u/1_like_science
[link] [comments]
( 85
min )
Prompted by a recent discussion on social media, I did some benchmarks and wrote down my thoughts on why it doesn't really make a difference whether we choose batch sizes as powers of 2: https://sebastianraschka.com/blog/2022/batch-size-2.html
What is your experience, do you
do you stick to batch sizes as powers of 2 or do you choose batch sizes more freely?
notice a substantial difference when you choose batch sizes as powers of 2 (or multiples of 8)?
submitted by /u/seraschka
[link] [comments]
( 92
min )
submitted by /u/cganimater
[link] [comments]
( 83
min )
submitted by /u/LordPewPew777
[link] [comments]
( 84
min )
submitted by /u/bendee983
[link] [comments]
( 83
min )
submitted by /u/Zirius_Sadfaces
[link] [comments]
( 84
min )
submitted by /u/DavidKShapiro
[link] [comments]
( 84
min )
submitted by /u/rikusorasephiroth
[link] [comments]
( 83
min )
submitted by /u/muditjps
[link] [comments]
( 84
min )
Keras is a Python library for deep learning that wraps the efficient numerical libraries TensorFlow and Theano. Keras allows you to quickly and simply design and train neural network and deep learning models. In this post you will discover how to effectively use the Keras library in your machine learning project by working through a […]
The post Binary Classification Tutorial with the Keras Deep Learning Library appeared first on Machine Learning Mastery.
( 54
min )
A simple and powerful regularization technique for neural networks and deep learning models is dropout. In this post you will discover the dropout regularization technique and how to apply it to your models in Python with Keras. After reading this post you will know: How the dropout regularization technique works. How to use dropout on […]
The post Dropout Regularization in Deep Learning Models With Keras appeared first on Machine Learning Mastery.
( 37
min )
If you use the default lifecycle configuration for your domain or user profile in Amazon SageMaker Studio and use Amazon SageMaker Data Wrangler for data preparation, then this post is for you. In this post, we show how you can create a Data Wrangler flow and use it for data preparation in a Studio environment […]
( 8
min )
Socially disadvantaged communities have often raised legitimate concerns about being over-policed and under-protected. Now, the rise of AI…
( 12
min )
Since a customized AI solution is always individual, no one can give you a general cost estimate.
( 8
min )
Putting art, mathematics and computers together in the mid-1980s created a new genre of digital media: fractal art. In the NVIDIA Studio this week, computer graphics (CG) artist, educator and curator Xueguo Yang shares his insights behind fractal art — which uses algorithms to artistically represent calculations derived from geometric objects as digital images and animations.
The post Computer Graphics Artist Xueguo Yang Shares Fractal Art Series This Week ‘In the NVIDIA Studio’ appeared first on NVIDIA Blog.
( 7
min )
submitted by /u/pixelz_ai
[link] [comments]
( 82
min )
The study on explainability or explainable AI is currently receiving a lot of attention as DNNs become accessible in a variety of application domains. Many explainability techniques that attempt to provide the local explanation of the DNNs prediction for a particular instance, such as techniques that provide saliency maps for understanding which sub-parts in an instance are most responsible for the model prediction, have been proposed in an effort to open the black box of DNNs.
While local explanation techniques have seen a rapid growth in research in recent years, the majority of attention has been placed on handling the generation of explanations rather than understanding whether the explanations are accurate or reasonable, what to do if they are, and how to modify the model to produce more accurate or reasonable explanations.
Continue reading | Checkout the paper and github
submitted by /u/ai-lover
[link] [comments]
( 84
min )
Artificially intelligent models have recently advanced to the point that users will soon be able to utilize these models to immediately construct and alter nearly photorealistic three-dimensional sceneries from the comfort of their laptops. Since these technologies make it simple to generate hyperrealistic avatars, they will revolutionize the way artists working on video games and CGI for movies approach their work. For quite some time, AIs have been able to create realistic 2D images. However, 3D scenarios have proven to be more challenging due to the enormous computer power needed. The AI model EG3D, created by a team of Stanford academics, can be used to produce random high-resolution images of faces and other things having an underlying geometric structure. This model is one of the first 3D models now in use to reach rendering quality close to photorealism.
Continue reading | Checkout the paper, github
submitted by /u/ai-lover
[link] [comments]
( 84
min )
submitted by /u/DragonGod2718
[link] [comments]
( 83
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 83
min )
submitted by /u/OmitsWordsByAccident
[link] [comments]
( 83
min )
submitted by /u/RubiksCodeNMZ
[link] [comments]
( 83
min )
https://codingvidya.com/best-artificial-intelligence-courses-for-healthcare/
submitted by /u/Lakshmireddys
[link] [comments]
( 82
min )
A blog about representation learning from masked images, what makes a good mask, and how to learn such masks: https://akosiorek.github.io/ml/2022/07/04/masking_repr_learning_vision.html.
Based on a recent ICML paper: Shi et. al, "Adversarial Masking for Self-Supervised Learning", ICML 2022.
submitted by /u/ErrorDry4380
[link] [comments]
( 84
min )
submitted by /u/gwern
[link] [comments]
( 83
min )
submitted by /u/estasfuera
[link] [comments]
( 82
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 83
min )
submitted by /u/SlightSituation
[link] [comments]
( 83
min )
submitted by /u/SlightSituation
[link] [comments]
( 83
min )
submitted by /u/bartturner
[link] [comments]
( 83
min )
Hi all. Created a basic guide on generating AI art using VQGAN+CLIP. This is for biginners:
VQGAN - A step-by-step guide
submitted by /u/Laks_Abey
[link] [comments]
( 82
min )
submitted by /u/glenniszen
[link] [comments]
( 82
min )
submitted by /u/prfitofthesngularity
[link] [comments]
( 82
min )
submitted by /u/Snoo63916
[link] [comments]
( 84
min )
submitted by /u/zicxor
[link] [comments]
( 84
min )
submitted by /u/keghn
[link] [comments]
( 82
min )
submitted by /u/tohelpyou88
[link] [comments]
( 83
min )
submitted by /u/PixelzJ
[link] [comments]
( 82
min )
submitted by /u/nalr00n
[link] [comments]
( 83
min )
Active research projects frequently devolve into a jumble of files with varying degrees of descriptive names processed by Python programs and bound together by Bash scripts. People can never be entirely sure that they can actually repeat a result since intermediate outcomes disappear or become difficult to locate.
Tango ensures you never operate on outdated data by taking care of your intermediate and final outcomes and finding them again when needed.
What does that actually mean?
Tango has a lot of capabilities, but its main feature is this:
Tango caches function results even if your process is restarted. If one merely takes advantage of one function, Tango can significantly benefit you.
Continue reading | Github
submitted by /u/ai-lover
[link] [comments]
( 83
min )
submitted by /u/Historical-Object374
[link] [comments]
( 83
min )
submitted by /u/t-bands
[link] [comments]
( 84
min )
submitted by /u/Illustrious_Row_9971
[link] [comments]
( 84
min )
submitted by /u/cloud_weather
[link] [comments]
( 84
min )
submitted by /u/surelyouarejoking
[link] [comments]
( 88
min )
I want to share the PyTorch implementation of "An Improved One millisecond Mobile Backbone" paper.
Unfortunately, I don't have the appropriate computational resources to train the models on ImageNet, so feel free to use my implementation for that purpose.
Hope you all find it useful, feedback would be appreciated.
Repository: https://github.com/federicopozzi33/MobileOne-PyTorch
Paper: https://arxiv.org/abs/2206.04040
submitted by /u/FedEx33
[link] [comments]
( 84
min )
submitted by /u/Snoo63916
[link] [comments]
( 84
min )
Hey, I wanted to share my recent ML project: LCPN-hiernet.
LCPN-hiernet is a hierarchical image classification model for e-commerce items based on EfficientNet-b4 and LCPN (Local Classifier per Parent Node) technique.
LCPN technique is training one multi-class classifier for each parent node, to distinguish between its child nodes. In my example of classifying fashion products, that would mean one classifier on the first level (to determine “bags”, “clothes” or “accessories”), then three more classifiers to determine the specific model.
I’m sure there are a lot of places to improve on, and I would really appreciate anyone’s feedback or suggestions on how I can improve!
Github Repo
Project Page
submitted by /u/tylertaewook
[link] [comments]
( 85
min )
submitted by /u/rikusorasephiroth
[link] [comments]
( 82
min )
submitted by /u/walt74
[link] [comments]
( 82
min )
submitted by /u/LordPewPew777
[link] [comments]
( 83
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 82
min )
submitted by /u/mr_j_b
[link] [comments]
( 83
min )
submitted by /u/mr_j_b
[link] [comments]
( 83
min )
submitted by /u/VIRUS-AOTOXIN
[link] [comments]
( 82
min )
Tensors are an effective method for handling and representing multidimensional data arrays. However, they have a limitation in terms of storage and computation. Tensor decompositions are crucial in machine learning because they factorize the weights of neural networks. This research introduces tntorch, an open-source python package for tensor learning that supports several decompositions through a single user interface. In contrast to the state-of-the-art packages, tntorch emphasizes an easy-to-use, decomposition-independent interface inherited from PyTorch.
🚦 An open-source python package for tensor learning that supports several decompositions through a single user interface
🚦 In contrast to the state-of-the-art packages, tntorch emphasizes an easy-to-use, decomposition-independent interface inherited from PyTorch
🚦 Several decomposition models that are crucial in machine learning, such as CANDEDOMP/ PARAFAC (CP), the Tucker decomposition, and the tensor train (TT), is supported by tntorch
🚦 It gives machine learning access to the power of low-rank tensor decompositions while maintaining the excellent appearance and feel of PyTorch tensors
Continue reading | Checkout the paper and github
submitted by /u/ai-lover
[link] [comments]
( 84
min )
submitted by /u/nick7566
[link] [comments]
( 83
min )
An in-depth analysis about regulations for AI in medical devices.
( 19
min )
Hyperparameter optimization is a big part of deep learning. The reason is that neural networks are notoriously difficult to configure and there are a lot of parameters that need to be set. On top of that, individual models can be very slow to train. In this post you will discover how you can use the grid […]
The post How to Grid Search Hyperparameters for Deep Learning Models in Python With Keras appeared first on Machine Learning Mastery.
( 172
min )
AI Weirdness: the strange side of machine learning
( 2
min )
Anomalib is Machine Library developed by AI researchers from Intel which implements state of the art algorithms for anomaly detection. Anomaly detection is popular use case in the industrial sector and such algorithms can help provide real-time feedback to manufactures on how well their production lines are performing.
Anomaly Detection is a challenging problem often due to a biased dataset. Anomalous images can be scare therefore these algorithms are trained on good images in an unsupervised fashion. By learning the normality, upon inference, the models can detect whether images are anomalous or not.
Anomalib was built using a PyTorchLightning Backbone and offers an easy way to deploy the models with OpenVino for inference speedup.
Link to the github repo: https://github.com/openvinotoolkit/anomalib
Link to a tutorial on how to train your custom dataset with anomalib: https://github.com/openvinotoolkit/anomalib/tree/development/docs/blog/001-train-custom-dataset
Please feel free to check out the repo and give us your feedback
submitted by /u/alder-ice
[link] [comments]
( 87
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 84
min )
submitted by /u/LeglessLoach
[link] [comments]
( 83
min )
submitted by /u/cold-depths
[link] [comments]
( 85
min )
submitted by /u/OneFinding1429
[link] [comments]
( 84
min )
submitted by /u/estasfuera
[link] [comments]
( 85
min )
submitted by /u/estasfuera
[link] [comments]
( 85
min )
submitted by /u/bperki8
[link] [comments]
( 85
min )
submitted by /u/Hallowmew
[link] [comments]
( 86
min )
submitted by /u/Various_Yoghurt1859
[link] [comments]
( 84
min )
submitted by /u/NinaMJ
[link] [comments]
( 84
min )
submitted by /u/MufBoiLegend420
[link] [comments]
( 85
min )
submitted by /u/chelsea_bear
[link] [comments]
( 85
min )
submitted by /u/imapurplemango
[link] [comments]
( 85
min )
You can access Amazon SageMaker Studio notebooks from the Amazon SageMaker console via AWS Identity and Access Management (IAM) authenticated federation from your identity provider (IdP), such as Okta. When a Studio user opens the notebook link, Studio validates the federated user’s IAM policy to authorize access, and generates and resolves the presigned URL for […]
( 6
min )
In part 1 of this series, we demonstrated how to resolve an Amazon SageMaker Studio presigned URL from a corporate network using Amazon private VPC endpoints without traversing the internet. In this post, we will continue to build on top of the previous solution to demonstrate how to build a private API Gateway via Amazon API […]
( 7
min )
Some things are easy as A, B, C. But when it comes to autonomous vehicles, the key may be in one, two, three. Faction, a Bay Area-based startup and NVIDIA Inception member, is preparing to debut its business-to-business autonomous delivery service, accelerating its commercial deployment with three-wheel production electric vehicles purpose-built for driverless services. In Read article >
The post Three Wheeling: Startup Faction Develops Affordable Tri-Wheel AVs on NVIDIA DRIVE appeared first on NVIDIA Blog.
( 5
min )
Turn the TV on. GeForce NOW is leveling up gaming in the living room. The Samsung Gaming Hub launched today, delivering GeForce NOW natively on 2022 Samsung Smart TVs. Plus, the SHIELD Software Experience Upgrade 9.1 is now rolling out to all NVIDIA SHIELD TVs, delivering new gaming features that improve GeForce NOW. Great living Read article >
The post The Gaming Evolution Will Be Televised: GFN Thursday Levels Up the Living Room Experience on New Samsung TVs and More appeared first on NVIDIA Blog.
( 8
min )
Blog post miniseries summarizing byteLAKE’s recommendation about hardware platforms to perform CFD Suite’s AI Training at the Edge.
( 15
min )
Researchers develop tools to help data scientists make the features used in machine-learning models more understandable for end users.
( 7
min )
submitted by /u/laul_pogan
[link] [comments]
( 83
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 83
min )
submitted by /u/BB4evaTB12
[link] [comments]
( 82
min )
submitted by /u/much_successes
[link] [comments]
( 82
min )
submitted by /u/lucapiccinelli
[link] [comments]
( 82
min )
submitted by /u/datapablo
[link] [comments]
( 82
min )
submitted by /u/Peter909098
[link] [comments]
( 82
min )
RStudio on Amazon SageMaker is the industry’s first fully managed RStudio Workbench in cloud. You can quickly launch the familiar RStudio integrated development environment (IDE), and dial up and down the underlying compute resources without interrupting your work, making it easy to build machine learning (ML) and analytics solutions in R at scale. RStudio on […]
( 11
min )
Online conversations are ubiquitous in modern life, spanning industries from video games to telecommunications. This has led to an exponential growth in the amount of online conversation data, which has helped in the development of state-of-the-art natural language processing (NLP) systems like chatbots and natural language generation (NLG) models. Over time, various NLP techniques for […]
( 11
min )
Large attention-based transformer models have obtained massive gains on natural language processing (NLP). However, training these gigantic networks from scratch requires a tremendous amount of data and compute. For smaller NLP datasets, a simple yet effective strategy is to use a pre-trained transformer, usually trained in an unsupervised fashion on very large datasets, and fine-tune […]
( 7
min )
With the growth in adoption of online applications and the rising number of internet users, digital fraud is on the rise year over year. Amazon Fraud Detector provides a fully managed service to help you better identify potentially fraudulent online activities using advanced machine learning (ML) techniques, and more than 20 years of fraud detection […]
( 17
min )
If you’ve looked at Keras models on Github, you’ve probably noticed that there are some different ways to create models in Keras. There’s the Sequential model which allows you to define an entire model in a single line, usually with some line breaks for readability, then there’s the functional interface that allows for more complicated […]
The post Three Ways to Build Machine Learning Models in Keras appeared first on Machine Learning Mastery.
( 24
min )
Silicon Valley magic met Wednesday with 175 years of industrial technology leadership as Siemens CEO Roland Busch and NVIDIA Founder and CEO Jensen Huang shared their vision for an “industrial metaverse” at the launch of the Siemens Xcelerator business platform in Munich. “When we combine the real and digital worlds we can achieve new levels Read article >
The post The Metaverse Goes Industrial: Siemens, NVIDIA Extend Partnership to Bring Digital Twins Within Easy Reach appeared first on NVIDIA Blog.
( 8
min )
NVIDIA and its partners continued to provide the best overall AI training performance and the most submissions across all benchmarks with 90% of all entries coming from the ecosystem, according to MLPerf benchmarks released today. The NVIDIA AI platform covered all eight benchmarks in the MLPerf Training 2.0 round, highlighting its leading versatility. No other Read article >
The post NVIDIA, Partners Show Leading AI Performance and Versatility in MLPerf appeared first on NVIDIA Blog.
( 7
min )
The June NVIDIA Studio Driver is available for download today, optimizing the latest creative app updates, all with the stability and reliability that users count on. Creators with NVIDIA RTX GPUs will benefit from faster performance and new features within Blender version 3.2, BorisFX Sapphire release 2022.5 and Topaz Denoise AI 3.7.0.
The post NVIDIA Studio Driver Elevates Creative Workflows in Blender 3.2, BorisFX Sapphire and Topaz Denoise AI appeared first on NVIDIA Blog.
( 7
min )
submitted by /u/tohelpyou88
[link] [comments]
( 83
min )
submitted by /u/lucapiccinelli
[link] [comments]
( 83
min )
Addressing and mitigating the effects of climate change requires a collective effort, bringing our strengths to bear across industry, government, academia, and civil society.
The post Introducing the Microsoft Climate Research Initiative appeared first on Microsoft Research.
( 10
min )
submitted by /u/LordPewPew777
[link] [comments]
( 82
min )
submitted by /u/henlo_there_fren
[link] [comments]
( 83
min )
submitted by /u/PineappleTreePro
[link] [comments]
( 82
min )
I read a super interesting KDD 2022 paper recently - "Learning Backward Compatible Embeddings".
The paper tackles a common industry problem of ensuring compatibility of newer embeddings with an older downstream model.
An annotated version of the paper - Annotated-ML-Papers/Learning Backward Compatible Embeddings.pdf
submitted by /u/shreyansh26
[link] [comments]
( 82
min )
submitted by /u/VictorTuring
[link] [comments]
( 83
min )
submitted by /u/VictorTuring
[link] [comments]
( 82
min )
submitted by /u/HumanSeeing
[link] [comments]
( 83
min )
submitted by /u/CALP_is_holy
[link] [comments]
( 82
min )
submitted by /u/bartturner
[link] [comments]
( 82
min )
submitted by /u/_ayushp_
[link] [comments]
( 86
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 83
min )
How do pro-choice vs. pro-life twitter users differ?
I built a free, labeled dataset of #RoeVsWade tweets, and an ML classifier on top.
Some insights:
Pro-life users are 20.4x more likely to put "christ" and 16.1x more likely to put "maga" in their bio.Pro-choice users are 7.5x more likely to put "blm" and 6.5x more likely to put "she/her".
Full analysis + link to raw dataset here.
submitted by /u/BB4evaTB12
[link] [comments]
( 84
min )
I read a super interesting KDD 2022 paper recently - "Learning Backward Compatible Embeddings".
The paper tackles a common industry problem of ensuring compatibility of newer embeddings with an older downstream model.
An annotated version of the paper - Annotated-ML-Papers/Learning Backward Compatible Embeddings.pdf
submitted by /u/shreyansh26
[link] [comments]
( 84
min )
Transformers are awesome for so many things in 2022, but one thing I've found them to struggle with is generating embeddings for long documents.
I put together a blog post going through some interesting techniques. Let me know if it helped you!
Blog post
submitted by /u/BlockDesigns
[link] [comments]
( 84
min )
Just released. Quaterion — an open source framework for training and fine-tuning similarity learning models. It enables you to train models significantly (100x) faster, and iterate over experiments in minutes instead of hours even with a laptop GPU. It takes advantage of the PyTorch Lightning backend to make a flexible and scalable learning pipeline. GitHub https://github.com/qdrant/quaterion
Here is a demo of the caching functionality.
https://i.redd.it/9qi8gf9n4d891.gif
submitted by /u/devzaya
[link] [comments]
( 84
min )
Developers can use RestifyML to
Create DataScience experiments
Create Data Source and upload CSV data within the experiment
Do Data Cleansing and Sanitization
Visualize raw data using Data Exploration
Select Features which would help in building models
Build Model, save or export them
Finally, deploy Model and expose them as REST API
Consume Machine Learning REST API from any Application
Profit!
https://github.com/rebataur/RestifyML
Feedback/ Feature Request appreciated.
submitted by /u/rebataur
[link] [comments]
( 85
min )
submitted by /u/pcaversaccio
[link] [comments]
( 86
min )
Jupyter Notebook: https://colab.research.google.com/drive/1BsVkddtVMX35aZAvo2GyI-wSFPVBCWuA
Github: https://github.com/facebookresearch/torchdim
Some tweet threads about it
Mine: https://twitter.com/cHHillee/status/1541536627746426881
Sasha Rush: https://twitter.com/srush_nlp/status/1541526906113298433
submitted by /u/programmerChilli
[link] [comments]
( 84
min )
The second AI Policy Forum Symposium convened global stakeholders across sectors to discuss critical policy questions in artificial intelligence.
( 7
min )
submitted by /u/gwern
[link] [comments]
( 83
min )
submitted by /u/AnimoIsland
[link] [comments]
( 83
min )
Amazon Polly is a leading cloud-based service that converts text into lifelike speech. Following the adoption of Neural Text-to-Speech (NTTS), we have continuously expanded our portfolio of available voices in order to provide a wide selection of distinct speakers in supported languages. Today, we are pleased to announce four new additions: Pedro speaking US Spanish, […]
( 5
min )
Amazon SageMaker provides a suite of built-in algorithms, pre-trained models, and pre-built solution templates to help data scientists and machine learning (ML) practitioners get started on training and deploying ML models quickly. You can use these algorithms and models for both supervised and unsupervised learning. They can process various types of input data, including tabular, […]
( 7
min )
In computer vision, semantic segmentation is the task of classifying every pixel in an image with a class from a known set of labels such that pixels with the same label share certain characteristics. It generates a segmentation mask of the input images. For example, the following images show a segmentation mask of the cat […]
( 9
min )
Every business needs the ability to predict the future accurately in order to make better decisions and give the company a competitive advantage. With historical data, businesses can understand trends, make predictions of what might happen and when, and incorporate that information into their future plans, from product demand to inventory planning and staffing. If […]
( 10
min )
Taiwan has nearly 85,000 kidney dialysis patients — the highest prevalence in the world based on population density. Taipei Veterans General Hospital (TVGH) is working to improve outcomes for these patients with an AI model that predicts heart failure risk in real time during dialysis procedures. Cardiovascular disease is the leading cause of death for Read article >
The post Detect to Protect: Taiwan Hospital Deploys Real-Time AI Risk Prediction for Kidney Patients appeared first on NVIDIA Blog.
( 7
min )
Moderation is the process of controlling the wanted contents from the online platforms like social media networking sites. And it is…
( 8
min )
“Homer Simpson reacting to the crash of Bitcoin”
Continue reading on Becoming Human: Artificial Intelligence Magazine »
( 10
min )
submitted by /u/rikusorasephiroth
[link] [comments]
( 83
min )
submitted by /u/trcytony
[link] [comments]
( 82
min )
submitted by /u/Diana-RS
[link] [comments]
( 82
min )
submitted by /u/regalalgorithm
[link] [comments]
( 83
min )
submitted by /u/regalalgorithm
[link] [comments]
( 82
min )
submitted by /u/Diana-RS
[link] [comments]
( 82
min )
View the tutorial here: HERE
This tutorial teaches you how to convert any text prompt to an image using VQGAN-Clip.
For example you could use the prompt "A spray painting of a waiting computer and a bedroom in the style of Edgar Degas and Art Nouveau".
This would generate the following image:
https://imgur.com/J3qGlc4
Let me know if you have any questions or comments.
submitted by /u/mshriver2
[link] [comments]
( 83
min )
submitted by /u/bendee983
[link] [comments]
( 83
min )
submitted by /u/justine01923
[link] [comments]
( 83
min )
submitted by /u/bartturner
[link] [comments]
( 82
min )
submitted by /u/AChickenInAHole
[link] [comments]
( 82
min )
submitted by /u/PedroRibs
[link] [comments]
( 84
min )
With a potential recession lurking on the horizon, 99% of companies will make the same old “safe” mistakes: hunker down, let people go, shrink, and hope to hold on for dear life. However, growth-oriented organizations will see this as a business opportunity – an opportunity to leverage their data to “do more with less”. You… Read More »Data & Analytics Regression Playbook: Make Your Data Work Harder…And Smarter!
The post Data & Analytics Regression Playbook: Make Your Data Work Harder…And Smarter! appeared first on Data Science Central.
( 22
min )
submitted by /u/s7v7nsilver
[link] [comments]
( 84
min )
We're used to finding that task performance scales well with large increases in sizes of language models. But for real-world applications, it's also very meaningful to search for failure cases preemptively to fix the underlying issues. Can you find and convincingly demonstrate these failure cases where language models scale inversely, with larger models behaving worse?
You don't necessarily need to have extra deep knowledge of ML or language models in order to participate and win, because all models are frozen and you only need to come up with the right data.
Check out these resources to learn more! Announcement Twitter thread, contest details on Github. The deadline for the first round of the contest is August 27, 2022.
submitted by /u/alexlyzhov
[link] [comments]
( 87
min )
Surprised I haven't seen more chatter about this. What do you think about Nvidia's instant Nerf which turns 2d into 3d based on these techniques https://arxiv.org/abs/2003.10016
Does the output of a NeRF give a depth map that's comparable to what you'd get from a Kinect?
Can these be used to create 3D models one would use in Unreal or Blender?
submitted by /u/KalloDotIO
[link] [comments]
( 84
min )
Hey r/MachineLearning! We are the co-founders of Stack, a hub for data collaboration and versioning. We are developing this tool to help ML teams automatically track changes in their data seamlessly.
We are opening a waiting list for our beta, which we aim to release soon. You can sign up at: https://www.getstack.ai/
We are also actively looking for feedback. Feel free to share any comments or thoughts!
submitted by /u/baceituno
[link] [comments]
( 84
min )
I am trying to train a latent-diffusion model by following the instructions on the repo, however I am running into errors while sampling from the checkpointed models. Can someone help?
I am getting Errors while trying to sample using sample_diffusion.py from a custom model trained on LSUN churches
submitted by /u/icelebratefestivus
[link] [comments]
( 85
min )
We usually use TensorFlow to build a neural network. However, TensorFlow is not limited to this. Behind the scene, TensorFlow is a tensor library with automatic differentiation capability. Hence we can easily use it to solve a numerical optimization problem with gradient descent. In this post, we are going to show how TensorFlow’s automatic differentiation […]
The post Using autograd in TensorFlow to Solve a Regression Problem appeared first on Machine Learning Mastery.
( 16
min )
View the tutorial here: HERE
This tutorial teaches you how to convert any text prompt to an image using VQGAN-Clip.
For example you could use the prompt "A spray painting of a waiting computer and a bedroom in the style of Edgar Degas and Art Nouveau".
This would generate the following image:
https://imgur.com/J3qGlc4
Let me know if you have any questions or comments.
submitted by /u/mshriver2
[link] [comments]
( 83
min )
Object localization trained from scratch for emoji dataset in TensorFlow 2.8. Getting an IoU = 0.5969 and classification output accuracy = 100%. The code can be referred here. Though in fairness, I am using only 9 classes out of the emoji dataset. Thoughts?
submitted by /u/grid_world
[link] [comments]
( 82
min )
submitted by /u/Plazmeer
[link] [comments]
( 82
min )
submitted by /u/disdisinform
[link] [comments]
( 85
min )
Launched at AWS re:Invent 2021, Amazon SageMaker Ground Truth Plus helps you create high-quality training datasets by removing the undifferentiated heavy lifting associated with building data labeling applications and managing the labeling workforce. All you do is share data along with labeling requirements, and Ground Truth Plus sets up and manages your data labeling workflow […]
( 6
min )
submitted by /u/AquaHug
[link] [comments]
( 85
min )
submitted by /u/markurtz
[link] [comments]
( 84
min )
submitted by /u/_ayushp_
[link] [comments]
( 90
min )
submitted by /u/Illustrious_Row_9971
[link] [comments]
( 84
min )
maybe you haven’t heard this, this is voice https://youtu.be/FAvcn_8OuMk this sound is really good. im wondering if anyone knows which al is used for text to voice?
submitted by /u/Basic_Pay7859
[link] [comments]
( 82
min )
submitted by /u/dulldata
[link] [comments]
( 82
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 82
min )
submitted by /u/henlo_there_fren
[link] [comments]
( 82
min )
submitted by /u/urocyon_dev
[link] [comments]
( 84
min )
submitted by /u/nick7566
[link] [comments]
( 82
min )
submitted by /u/IntelligentHat1657
[link] [comments]
( 82
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 82
min )
submitted by /u/Shaftershafter
[link] [comments]
( 82
min )
submitted by /u/gwern
[link] [comments]
( 83
min )
submitted by /u/gwern
[link] [comments]
( 82
min )
submitted by /u/gwern
[link] [comments]
( 84
min )
submitted by /u/cityofgoul
[link] [comments]
( 82
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 82
min )
submitted by /u/getrich_or_diemining
[link] [comments]
( 82
min )
submitted by /u/the_anonymizer
[link] [comments]
( 82
min )
submitted by /u/binaryfor
[link] [comments]
( 82
min )
submitted by /u/LordPewPew777
[link] [comments]
( 82
min )
submitted by /u/nick7566
[link] [comments]
( 84
min )
submitted by /u/niIbert
[link] [comments]
( 82
min )
submitted by /u/nalr00n
[link] [comments]
( 82
min )
We are happy to invite you to the Hugging Face Gradio CVPR event - a community event in which we will create interactive demos for CVPR papers. Demos are powerful because they allow anyone — not just ML engineers — to try out models in the browser, give feedback on predictions, identify trustworthy models. The event is open until June 30th, 2022 (AOE Time Zone). We are organizing this event on Huggingface: https://huggingface.co/CVPR. Prizes will be given at the end of the event.
Demos will be built with Gradio and we encourage using the new Gradio Blocks API. Blocks allows you to build web-based demos in a flexible way using the Gradio library. Gradio is a popular choice for building demos for machine learning models, as it allows you to create web-based UIs all in Python. For example, here is a Gradio Demo for FLAVA: A Foundational Language And Vision Alignment Model:
https://reddit.com/link/vkqmhu/video/48cnmkfiku791/player
submitted by /u/Illustrious_Row_9971
[link] [comments]
( 84
min )
submitted by /u/Just_Ad8110
[link] [comments]
( 84
min )
Data annotation is at the forefront of the recent revolution in healthcare AI, driving continuous progress in the field through continuous innovation through the idea of Artificial Intelligence. A computer program can use human intelligence to perform many tasks that humans carry out today. The concept is called artificial intelligence (AI). Finding tumors, discovering kidney… Read More »Datasets and Data Annotation — The Building Blocks for Healthcare AI
The post Datasets and Data Annotation — The Building Blocks for Healthcare AI appeared first on Data Science Central.
( 22
min )
submitted by /u/urocyon_dev
[link] [comments]
( 83
min )
submitted by /u/tohelpyou88
[link] [comments]
( 82
min )
submitted by /u/gwern
[link] [comments]
( 82
min )
submitted by /u/estasfuera
[link] [comments]
( 82
min )
submitted by /u/Available_Tadpole829
[link] [comments]
( 82
min )
submitted by /u/DaveBowman1975
[link] [comments]
( 82
min )
submitted by /u/OnlyProggingForFun
[link] [comments]
( 82
min )
submitted by /u/ai_jobs
[link] [comments]
( 82
min )
Today, we’re excited to announce that Amazon Forecast offers the ability to generate forecasts on a selected subset of items. This helps you to leverage the full value of your data, and apply it selectively on your choice of items reducing the time and effort to get forecasted results. Generating a forecast on ‘all’ items of the […]
( 5
min )
The content and opinions in this post are those of the third-party author and AWS is not responsible for the content or accuracy of this post. As more organizations use deep learning techniques such as computer vision and natural language processing, the machine learning (ML) developer persona needs scalable tooling around experiment tracking, lineage, and […]
( 8
min )
This blog post is co-authored by Guillermo Ribeiro, Sr. Data Scientist at Cepsa. Machine learning (ML) has rapidly evolved from being a fashionable trend emerging from academic environments and innovation departments to becoming a key means to deliver value across businesses in every industry. This transition from experiments in laboratories to solving real-world problems in […]
( 9
min )
In a previous post, we talked about analyzing and tagging assets stored in Veeva Vault PromoMats using Amazon AI services and the Veeva Vault Platform’s APIs. In this post, we explore how to use Amazon AppFlow, a fully managed integration service that enables you to securely transfer data from software as a service (SaaS) applications […]
( 12
min )
As enterprise businesses embrace machine learning (ML) across their organizations, manual workflows for building, training, and deploying ML models tend to become bottlenecks to innovation. To overcome this, enterprises needs to shape a clear operating model defining how multiple personas, such as data scientists, data engineers, ML engineers, IT, and business stakeholders, should collaborate and […]
( 18
min )
We are excited to announce Amazon CodeWhisperer, a machine learning (ML)-powered service that helps improve developer productivity by providing code recommendations based on developers’ natural comments and prior code. With CodeWhisperer, developers can simply write a comment that outlines a specific task in plain English, such as “upload a file to S3.” Based on this, […]
( 6
min )
Running machine learning (ML) experiments in the cloud can span across many services and components. The ability to structure, automate, and track ML experiments is essential to enable rapid development of ML models. With the latest advancements in the field of automated machine learning (AutoML), namely the area of ML dedicated to the automation of […]
( 6
min )
Over recent years, web scraping has become an incredibly popular practice, the rise of this field being largely attributed to the vast amounts of data that are produced and distributed every single day.
The post 5 Most Common Use Cases for Web Scraping appeared first on Data Science Central.
( 24
min )
You may not know of Todd Mozer, but it’s likely you have experienced his company: It has enabled voice and vision AI for billions of consumer electronics devices worldwide. Sensory, started in 1994 from Silicon Valley, is a pioneer of compact models used in mobile devices from the industry’s giants. Today Sensory brings interactivity to Read article >
The post Finding NeMo: Sensory Taps NVIDIA AI for Voice and Vision Applications appeared first on NVIDIA Blog.
( 5
min )
To foster climate action for a healthy global environment, NVIDIA is working with the United Nations Satellite Centre (UNOSAT) to apply the powers of deep learning and AI. The effort supports the UN’s 2030 Agenda for Sustainable Development, which has at its core 17 interrelated Sustainable Development Goals. These SDGs — which include “climate action” Read article >
The post UN Satellite Centre Works With NVIDIA to Boost Sustainable Development Goals appeared first on NVIDIA Blog.
( 5
min )